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Abstract — We consider a windowed decoding scheme for LDPC 
convolutional codes that is based on the belief-propagation (BP) 
algorithm. We discuss the advantages of this decoding scheme 
and identify certain characteristics of LDPC convolutional code 
ensembles that exhibit good performance with the windowed 
decoder. We will consider the performance of these ensembles 
and codes over erasure channels with and without memory. We 
show that the structure of LDPC convolutional code ensembles 
is suitable to obtain performance close to the theoretical limits 
over the memoryless erasure channel, both for the BP decoder 
and windowed decoding. However, the same structure imposes 
limitations on the performance over erasure channels with 
memory. 

Index Terms — Low-density parity-check codes, Convolutional 
codes, Iterative decoding, Windowed decoding, Belief propaga- 
tion, Erasure channels, Decoding thresholds, Stopping sets. 



I. Introduction 

LOW-DENSITY parity-check (LDPC) codes, although in- 
troduced in the early 1960's [4|, were established as state- 
of-the-art codes only in the late 1990's with the application 
of statistical inference techniques Q to graphical models 
representing these codes (6J, |7). The promising results from 
LDPC block codes encouraged the development of convolu- 
tional codes defined by sparse parity-check matrices. 

LDPC convolutional codes (LDPC-CC) were first intro- 
duced in [8]. Ensembles of LDPC-CC have several attractive 
characteristics, such as thresholds approaching capacity with 
belief -propagation (BP) decoding [9], and BP thresholds close 
to the maximum a-posteriori (MAP) thresholds of random 
ensembles with the same degree distribution [10]. Whereas 
irregular LDPC block codes have also been shown to have BP 
thresholds close to capacity [11] , the advantage with convo- 
lutional counterparts is that good performance is achieved by 
relatively simple regular ensembles. Also, the construction of 
finite-length codes from LDPC-CC ensembles can be readily 
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optimized to ensure desirable properties, e.g. large girths 
and fewer cycles, using well-known techniques of LDPC 
code design. Most of these attractive features of LDPC-CC 
are pronounced when the blocklengths are large. However, 
BP decoding for these long codes might be computationally 
impractical. By implementing a windowed decoder, one can 
get around this problem. 

In this paper, a windowed decoding scheme brought to 
the attention of the authors by Liva fT2) is considered. This 
scheme exploits the convolutional structure of the parity-check 
matrix of the LDPC-CC to decode non-terminated codes, 
while maintaining many of the key advantages of iterative 
decoding schemes like the BP decoder, especially the low 
complexity and superior performance. Note that although 
similar decoding schemes were proposed in ]T3) , p4) , the 
aim in these papers was not to reduce the decoding latency or 
complexity. When used to decode terminated (block) LDPC- 
CC, the windowed decoder provides a simple, yet efficient 
way to trade-off decoding performance for reduced latency. 
Moreover, the proposed scheme provides the flexibility to set 
and change the decoding latency on the fly. This proves to 
be an extremely useful feature when the scheme is used to 
decode codes over upper layers of the internet protocol. 

Our contributions in this paper are to study the requirements 
of LDPC-CC ensembles for good performance over erasure 
channels with windowed decoding (WD). We are interested 
in identifying characteristics of ensembles that present a good 
performance-latency trade-off. Further we seek to find such 
ensembles that are able to withstand not just random erasures 
but also long bursts of erasures. We reiterate that we will be 
interested in designing ensembles that have the aforementioned 
properties, rather than designing codes themselves. Although 
the channels considered here are erasure channels, we note that 
the WD scheme can be used when the transmission happens 
over any channel. 

This paper is organized as follows. Section [H] introduces 
LDPC convolutional codes and the notation and terminology 



that will be used throughout the paper. In Section III we de- 
scribe the decoding algorithms that will be considered. Along 
with a brief description of the belief-propagation algorithm, we 
will introduce the windowed decoding scheme that is based 
on BP. Possible variants of the scheme will also be discussed. 



Section IV deals with the performance of LDPC-CC on the 



binary erasure channel. Starting with a short recapitulation of 
known results for BP decoding, we will discuss the asymptotic 
analysis of the WD scheme in detail. Finite-length analysis 
will include performance evaluation using simulations that 
reinforce the observations made in the analysis. For erasure 
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channels with memory, we analyse LDPC-CC ensembles both 
in the asymptotic setting and for finite lengths in Section[V] We 
also include simulations illustrating the good performance of 
codes derived from the designed protographs over the Gilbert- 
Elliott channel. Finally, we summarize our findings in Section 

eh 

II. LDPC CONVOLUTIONAL CODES 

In the following, we will define LDPC-CC, give a construc- 
tion starting from protographs, and discuss various ways of 
specifying ensembles of these codes. 

A. Definition 

A rate R = b/c binary, time-varying LDPC-CC is defined 
as the set of semi-infinite binary row vectors vr^i, satisfying 
TT 
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and Ofoo] is the semi-infinite all-zero row vector. The elements 
Hj(t), i = 0, 1, ••• ,m s in ([T| are binary matrices of size 
(c - b) x c that satisfy Q3) 

. Hj(t) = 0, for i < and i > m s , V t > 1 

. 3 t > such that H ms (i) ^ 

. H (t) has full rank V °t > 1. 
The parameter m s is called the memory of the code and i/ a = 
(m s + l)c is referred to as the constraint length. The first 
two conditions above guarantee that the code has memory m s 
and the third condition ensures that the parity-check matrix 
is full-rank. In order to get sparse graph codes, the Hamming 
weight of each column h of H^j must be very low, i.e., 
Wii(h) -C v s . Based on the matrices Hi(t), LDPC-CC can be 
classified as follows |8|. An LDPC-CC is said to be periodic 
if H,;(t) = Hj(i + t) V i = 0, 1, • • • , m s , V t and for some 
t > 1. When t = 1, the LDPC-CC is said to be time-invariant, 
in which case the time dependence can be dropped from the 
notation, i.e. Hj(t) = V i = 0, 1, • • • ,m s , V i. If neither 
of these conditions holds, it is said to be time-variant. 

Terminated LDPC-CC have a finite parity-check matrix 
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where we say that the convolutional code has been terminated 
after L instants. Such a code is said to be (J, K) regular if 
Hm has exactly J l's in every column and K l's in every row 
excluding the first and the last m s (c — b) rows, i.e. ignoring 
the terminated portion of the code. It follows that for a given 
J, the parity-check matrix can be made sparse by increasing c 
or m s or both, leading to different code constructions fT6| . In 
this paper, we will consider LDPC-CC characterized by large 
c and small m s . As in |9j, we will focus on regular LDPC-CC 
which can be constructed from a protograph. 

B. Protograph-based LDPC-CC 

A protograph JT7J is a relatively small bipartite graph from 
which a larger graph can be obtained by a copy-and-permute 
procedure — the protograph is copied M times, and then the 
edges of the individual replicas are permuted among the M 
replicas to obtain a single, large bipartite graph referred to as 
the derived graph. We will refer to M as the expansion factor. 
M is also referred to as the lifting factor in literature fTT) . 
Suppose the protograph possesses Np variable nodes (VNs) 
and Mp check nodes (CNs), with degrees Jj, j — 1, • • • , Np, 
and Ki, i = 1, • • • , Mp, respectively. Then the derived graph 
will consist of n = N P M VNs and m = M P M CNs. The 
nodes of the protograph are labeled so that if the VN Vj is 
connected to the CN Cj in the protograph, then Vj in a replica 
can only connect to one of the M replicated CVs. 

Protographs can be represented by means of an Mp x Np bi- 
adjacency matrix B, called the base matrix of the protograph 
where the entry Bjj represents the number of edges between 
CN C.- L and VN Vj (a non-negative integer, since parallel edges 
are permitted). The degrees of the VNs (CNs respectively) of 
the protograph are then equal to the sum of the corresponding 
column (row, respectively) of B. A (J, K) regular protograph- 
based code is then one with a base matrix where all VNs 
have degree J and all CNs, excluding those in the terminated 
portion of the code, have degree K. 

In terms of the base matrix, the copy-and-permute operation 
is equivalent to replacing each entry B,-j in the base matrix 
with the sum of Bj j distinct size-M permutation matrices. 
This replacement is done ensuring that the degrees are main- 
tained, e.g., a 2 in the matrix B is replaced by a matrix 
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(M) 
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where P^ 1 ' 1 and "P^" 1 ' ,axe two 



(M) 



permutation matrices of size M chosen to ensure that each 
row and column of H^*^ has two ones. The resulting matrix 
after the above transformation for each element of B, which 
is the biadjacency matrix of the derived graph, corresponds 
to the parity-check matrix H of the code. The derived graph 
therefore is nothing but the Tanner graph corresponding to the 
parity-check matrix H of the code. 

For different values of the expansion factor M, different 
blocklengths of the derived Tanner graph can be achieved, 
keeping the original graph structure imposed by the pro- 
tograph. We can hence think of protographs as defining 
code ensembles that are themselves subsets of random LDPC 
code ensembles. We will henceforth refer to a protograph 
B and the ensemble C it represents interchangeably. This 
means that the density evolution analysis for the ensemble 
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of codes represented by the protograph can be performed 
within the protograph. Furthermore, the structure imposed 
by a protograph on the derived graph can be exploited to 
design fast decoders and efficient encoders. Protographs give 
the code designer a refined control on the derived graph edge 
connections, facilitating good code design. 

Analogous to LDPC block codes, LDPC-CC can also be 
derived by a protograph expansion. As for block codes, 
the parity-check matrices of these convolutional codes are 
composed of blocks of size-Af square matrices. We now give 
two constructions of (J, K) regular LDPC-CC ensembles. 

1) Classical construction: We briefly describe the con- 
struction introduced in fl8) . For convenience, we will refer to 
this construction as the classical construction of (J, K) regular 
LDPC-CC ensembles. Let a be the greatest common divisor 
(gcd) of J and K. Then there exist positive integers J' and K' 
such that J = aJ', K = aK' , and gcd( J', K') = 1. Assuming 
we terminate the convolutional code after L instants, we obtain 
a block code, described by the base matrix 
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where m s = a — 1 is the memory of the LDPC-CC and , i — 
0, • • • , m s are J' x K' submatrices that are all identical and 
have all entries equal to 1. Note that an LDPC-CC constructed 
from the protograph with base matrix B[ £ j could be time- 
varying or not depending on the expansion of the protograph 
into the parity-check matrix. 

The protograph of the terminated code has Np — LK' 
VNs and M P = (L + m s )J' CNs. The rate of the LDPC-CC 
is therefore 



Rl = 1 



L 



L 



K> 
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L 
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where R = 1 — 
Note that R L -> 
distribution (9) 



K 1 



is the rate of the non-terminated code. 
> R and the LDPC-CC has a regular degree 
when L — » oo. We will assume that the 



parameters satisfy K' > J' and L > ^j^m s so that the 
rates R and Rl of the non-terminated and terminated codes, 
respectively, are in the proper range. 

The classical construction was proposed in fl8) and it 
produces protographs for some (J,K) regular LDPC-CC 
ensembles. However, not all (J,K) regular LDPC-CC can be 
constructed, e.g. m s becomes zero if J and K are relatively 
prime and consequently the resulting code has no memory. In 
J5), the authors addressed this problem by proposing a con- 
struction rule based on edge spreading. We denote an ensemble 
of (J, K) regular LDPC-CC constructed as described here as 
C C (J,K) with the subscript c for "classical" construction. 



2) Modified construction: We propose a modified con- 
struction that is similar to the classical construction except 
that we do not require that m s = a — 1, i.e. the memory 
of the LDPC-CC is independent of its degree distribution. 
We further disregard the requirement that the B^ matrices 
are identical and have only ones, i.e. parallel edges in the 
protograph are allowed. However, the sizes of the submatrices 
B i; i = 0, 1, • • • , m s will still be J' x K'. We will denote a 
(J, K) regular LDPC-CC ensemble constructed in this manner 
as C m (J,K), with subscript m for "modified" construction. 
Note that the rate of the C m (J, K) ensemble is still given by 
Equation |2]). Further, the independence of the code memory 
and the degree distribution allows us to construct LDPC-CC 
even when J and K are co-primes. This is illustrated in the 
following example. 

Example 1: Let J = 3 and K = 4. Clearly, a classical 
construction of this ensemble is not possible. However, with 
the modified construction, we can set m s = 1 and define the 
ensemble C m (J,K) given by 



Bn 




Bi 



10 
10 1 
10 



with design rate Rl = 1 — | ( ^77^) for a termination length L. 
Note that these submatrices are by no means the only possible 
ones. Another set of submatrices satisfying the constraints is 



2 1 
B = ( 2 1 
2 



Bx = 



10 
1 
110 



□ 



The above example brings out the similarity between the 
proposed modified construction and the technique of edge 
spreading employed in [9|, wherein the edges of the proto- 
graph defined by the matrix 




are "spread" between the matrices Bo and Bi (or between Bo 
and Bi) to obtain a (3,4) regular LDPC-CC ensemble with 
memory m s = 1. The advantage of the modified construction 
is thus clear — it gives us more degrees of freedom to design the 
protographs in comparison with the classical construction. In 
particular, the ensemble specified by the classical construction 
is contained in the set of ensembles allowed by the modified 
construction, meaning that the best performing C m (J,K) 
ensemble (with memory the same as that of the C C (J,K) 
ensemble) is at least as good as the C C (J, K) ensemble. Note 
that in |9|, there was no indication as to how edges are to be 
spread between matrices. With windowed decoding, we will 
shortly show that different protographs (edge spreadings) have 
different performances. We will also identify certain design 
criteria for efficient modified constructions that suit windowed 
decoding. 
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C. Polynomial representation of LDP C-CC ensembles 

We have thus far specified LDPC-CC ensembles by giving 
the parameter L and the matrices Bj, i = 0, 1, • • • , m s . An al- 
ternative specification of terminated protograph-based LDPC- 
CC ensembles using polynomials is useful in establishing 
certain properties of (J, K) regular ensembles and is described 
below. 

Instead of specifying (m s +l) matrices B; of size J' x K' , 
we can specify the K' columns of the (m s + l)J'xK' matrix 



(o) , (i) . 

P 2 +P2 X + - 



-p^ x T 
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using a polynomial of degree no more than d = (rn s + 1)J' — 1 
for each column. The polynomial of the j th column 

f \ (0) , (1) , (2) 2 i i (d) d 

Pj{x) = pj +p) 'x +p\ x H \-p\ 'x (3) 

is defined so that the coefficient of x\ Pj, is the j) entry 
of Bp] for all i = 0, 1, • • • , d and j = 1, 2, • • • , K' . Therefore, 
an equivalent way of specifying the LDPC-CC ensemble is by 
giving L and the set of polynomials {pj (x),j = 1, 2, • • ■ , K'}. 
With this notation, the I th column of Bm is specified by 
the polynomial x J l Pj(x) where I = iK' + j for unique 
< i < L — 1 and 1 < j < K' . We can hence use "the 
column index" and "the column polynomial" interchangeably. 
Further, to define (J, K) regular ensembles, we will need the 
constraints 

Pj (l) = JV l<j<K' 



and 



A" 



YjPji 1 ) =KV0<m<J'-l, 

3=1 

where p'" 1 ' (x) is the polynomial of degree no larger than m s 
obtained from Pj(x) by collecting the coefficients of terms 
with degrees I where I = hJ' + m for some < h < m s , i.e. 
I = m(mod J'): 



[ml / \ (m) 

p) (x) =p) + p 



IJ'+m), 



(m s J'+m) ms 
Pj X 



E(hJ'+rn) h 
Pj X ■ 

h=0 



(4) 



We will refer to these polynomials as the modulo polyno- 
mials. Let us denote the set of polynomials defining an 
LDPC-CC ensemble as V = {Pj(x),j £ [iT]}, where 
[K 1 ] = {1,2, ■ ■ ■ , K'}, and the modulo polynomials as 
Vi = {pf(x),j E [K% l = 0, 1, • • • , J' - 1. Later in the 
paper, we will say "the summation of polynomials Pi(x) and 
Pj{x)" to mean the collection of the i th and the j th columns 
of Bji]. The following example illustrates the notation. 

Example 2: For (J, 2 J) codes, we have J' = 1 and K' = 2, 
the component base matrices Bj,i = 0, ...,m s are 1x2 
matrices. With the first column of the protograph Bm, we 
associate a polynomial pi(x) = p^ +p^x + ■ ■ ■ +p^ 3 ^ x m " 
of degree at most m s . Similarly, with the second column we 



associate a polynomial p 2 {x) 

also of degree at most m s . Then, the (2i + l) th column of 
Bm can be associated with the polynomial x 1 pi(x), and the 
(2i + 2) th column with the polynomial x l p2(x). As noted 
earlier, we will use the polynomial of a column and its index 
interchangeably, e.g. when we say "choosing the polynomial 
x l p\(x)" we mean that we choose the (2i + l) th column of 
Brj,]. Similarly, by "summations of polynomials p%{x) and 
P2(x)" we mean the collection of the corresponding columns 
of B[£j. In order to define (J, 2J) regular ensembles, we will 
further have the constraint pi(l) = P2(l) = J- In this case, 
since J' = 1, p± 0-) = is the same as the previous 

constraint, because pf\l) +p 2 °\l) — Pi(l) +P2(1)- □ 

We define the minimum degree of a polynomial a(x) as the 
least exponent of x with a positive coefficient and denote it as 
mindeg(a(x)). Clearly, < mindeg(a(x)) < deg(a(x)). Let 
us define a partial ordering of polynomials with non-negative 
integer coefficients as follows. We write a(x) -< b(x) if 
mindeg(a(a;)) = mindeg(6(a;)), deg(a(a;)) = deg(b(x)) and 
the coefficients of a(x) are no larger than the corresponding 
ones of b(x). The ordering -< satisfies the following properties 
over polynomials with non-negative integer coefficients: if 
a(x) < b(x) and c(x) -< d(x), then 

a(x) + c(x) r< b{x) + d(x) 
a(x)c(x) ^ b(x)d(x). 

We define the boundary polynomial /3(a(x)) of a polynomial 
a{x) to be /3(a(x)) — x % + x^ where i = mindeg(a(x)) and 
j = deg(a(x)). Note that when i = j, we define (3(a(x)) = 
x 1 . We have for any polynomial a(x), f3(a(xj) ^ a(x). 

III. Decoding Algorithms 

LDPC-CC are characterized by a very large constraint 
length v s = (m s + 1)K'M. Since the Viterbi decoder has a 
complexity that scales exponentially in the constraint length, 
it is impractical for this kind of code. However, the sparsity 
of the parity-check matrix can be exploited and an iterative 
message passing algorithm can be adopted for decoding. We 
consider two specific iterative decoders here — a conventional 
belief -propagation decoder (6j, fl9) and a variant called a 
windowed decoder. 



A. Belief-Propagation (BP) 

For terminated LDPC-CC, decoding can be performed as 
in the case of an LDPC block code, meaning that each frame 
carrying a codeword obtained through the termination can be 
decoded with the sum-product algorithm (SPA) |19|. 

Note that since the BP decoder can start decoding only after 
the entire codeword is received, the total decoding latency 
Abp is given by Abp = T cw + T c i ec , where T cw is the time 
taken to receive the entire codeword and Td ec is the time 
needed to decode the codeword. In many practical applications 
this latency is large and undesirable. Moreover, for non- 
terminated LDPC-CC, a BP decoder cannot be employed. 
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Fig. 1. Illustration of windowed decoding (WD) with window of size W = 4 for a C m ( J, 2J) LDPC-CC with m s = 2 and L = 16 at the fourth decoding 
instant. This window configuration consists of Jyy = WJ'M = 4M rows of the parity-check matrix and all the (W + m a )K' M = 12M columns involved 
in these equations: this comprises the red (vertically hatched) and the blue (hatched) edges shown within the matrix. Note that the symbols shown in green 
(backhatched) above the parity-check matrix have all been processed. The targeted symbols are shown in blue (hatched) above the parity-check matrix and 
the symbols that are yet to be decoded are shown in gray above the parity-check matrix. 



B. Windowed Decoding (WD) 

The convolutional structure of the code imposes a constraint 
on the VNs connected to the same parity-check equations — 
two VNs of the protograph that are at least (m s + 1)K' 
columns apart cannot be involved in the same parity-check 
equation. This characteristic can be exploited in order to 
perform continuous decoding of the received stream through 
a "window" that slides along the bit sequence. Moreover, 
this structure allows for the possiblity of parallelizing the 
iterations of the message passing decoder through several 
processors working in different regions of the Tanner graph. 
A pipeline decoder based on this idea was proposed in |8). 
In this paper we consider a windowed decoder to decode 
terminated codes with reduced latency. Note that whereas 
a similar sliding window decoder was used to bound the 
performance of BP decoding in fl4| , we are interested in 
evaluating the performance of the windowed decoder from a 
perspective of reducing the decoding complexity and latency. 

Consider a terminated {J,K) regular parity-check matrix 
H built from a base matrix B. The windowed decoder works 
on sub-protographs of the code and the window size W is 
defined as the number of sets of J' CNs of the protograph B 
considered within each window. In the parity-check matrix H, 
the window thus consists of Jw = WJ'M — W(c — b) rows 
of H and all columns that are involved in the check equations 
corresponding to these rows. We will henceforth refer to the 
size of the window only in terms of the protograph with the 
corresponding size in the parity-check matrix implied. The 
window size W ranges between (m s + l) and (L— 1) because 
each VN in the protograph is involved in at most J'(m s + 1) 
check equations; and, although there are a total of Mp = 
J'(L + m s ) CNs in B, the decoder can perform BP when all 
the VN symbols are received, i.e. when L < W < L + m s . 



Apart from the window size, the decoder also has a (typically 
small) target erasure probability 5 > as a parameter! The 
aim of the WD is to reduce the erasure probability of every 
symbol in the codeword to a value no larger than 5. 

At the first decoding instant, the decoder performs belief- 
propagation over the edges within the window with the aim 
of decoding all of the first K' symbols in the window, called 
the targeted symbols. The window slides down J' rows and 
right K' columns in B after at least a fraction (1 — 8) of 
the targeted symbols are recovered (or, in general, after a 
maximum number of belief-propagation iterations have been 
performed), and continues decoding at the new position at the 
next decoding time instant. 

We refer to the set of edges included in the window at 
any particular decoding time instant as the window config- 
uration. In the terminated portion of the code, the window 
configuration will have fewer edges than other configurations 
within the code. Since the WD aims to recover only the 
targeted symbols within each window configuration, the entire 
codeword is recovered in L decoding time instants. Fig. [T] 
shows a schematic representation of the WD for W = 4. 

The decoding latency of the K' targeted symbols with WD 
is therefore given by Awd = Tw + Tdec(W), where T\y is 
the time taken to receive all the symbols required to decode 
the K' targeted symbols, and Td ec (W) is the time taken to 
decode the targeted symbols. The parameters T cw and Tw are 
related as 

_ (W + m^ _W±m 1 

IV j- jr. cw T ± CW ) 

LA L 

since at most (W + m s )K' symbols are to be received to 
process the targeted symbols. The relation between Td ec and 

'We will see shortly that setting 6 = is not necessarily the most efficient 
use of the WD scheme. 
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T d ec{W) is given by 

W 

T dec {W) = —T dec , 

since the complexity of BP decoding scales linearly in block- 
length and the WD uses BP decoding over WK' symbols 
in each window configuration. We assume that the number 
of iterations of message passing performed is fixed to be the 
same for the BP decoder and the WD. Thus, in latency-limited 
scenarios, we can use the WD to obtain a latency reduction 
of 

. ^ W + m s a . 
A-wd < ^ Asp = wA B p- 

The smallest latency supported by the code-decoder system 
is therefore at most a fraction w m i n = 2m £ +1 that of the 
BP decoder. As pointed out earlier, the only choice for non- 
terminated codes is to use some sort of a windowed decoder. 
For the sequence of ensembles indexed by L, with the choice 
of the proposed WD with a fixed finite window size W, 
the decoding latency vanishes as O(j^). We will typically be 
interested in small values of W where large gains in decoding 
latencies are achievable. Since the decoding latency increases 
as W increases, the trade-off between decoding performance 
and latency can be studied by analyzing the performance of 
the WD for the entire range of window sizes. 

Latency Flexibility: Although reduced latency is an impor- 
tant characteristic of WD, what is perhaps more useful practi- 
cally is the flexibility to alter the latency with suitable changes 
in the code performance. The latency can be controlled by 
varying the parameter W as required. If a large latency can be 
handled, W can be kept large ensuring good code performance 
and if a small latency is required, W can be made small while 
paying a price with the code performance (We will see shortly 
that the performance of WD is mono tonic in the window size). 

One possible variant of WD is a decoding scheme which 
starts with the smallest possible window size and the size 
is increased whenever targeted symbols cannot be decoded, 
i.e., the target erasure probability cannot be met within the 
fixed maximum number of iterations. Other schemes where 
the window size is either increased or decreased based on the 
performance of the last few window configurations are also 
possible. 

IV. Memoryless Erasure Channels 

In this section, we confine our attention to the performance 
of the LDPC-CC when the transmission occurs over a mem- 
oryless erasure channel, i.e. a binary erasure channel (BEC) 
parameterized by the channel erasure rate e. 

A. Asymptotic analysis 

We consider the performance of the LDPC-CC in terms of 
the average performance of the codes belonging to ensembles 
defined by protographs in the limit of infinite blocklengths 
and in the limit of infinite iterations of the decoder. As in the 
case of LDPC block codes, the ensemble average performance 
is a good estimate of the performance of a code in the 



ensemble with high probability. We will therefore concentrate 
on the erasure rate thresholds (TTJ of the code ensembles 
as a performance metric in our search for good LDPC-CC 
ensembles. 

1 ) BP: The asymptotic analysis of LDPC block codes with 
the BP decoder over the BEC has been well studied [20|-|23|. 
For LDPC-CC based on protographs, the BP decoding thresh- 
olds can be numerically estimated using the Protograph-EXIT 
(P-EXIT) analysis [ 24 1 . This method is similar to the standard 
EXIT analysis in that it tracks the mutual information between 
the message on an edge and the bit value corresponding to the 
VN on which the edge is incident, while maintaining the graph 
structure dictated by the protograprj^] 

The processing at a CN of degree dc results in an updating 
of the mutual information on the e?* edge as 

Iout,d c = C (iin,lj " ' ' j-^in,d c -l) = I'm,i (5) 

and the corresponding update at a VN of degree dy gives 

d v -l 

Iout,d v — V (Ich, linS, ■ ■ ■ , -fin, d v -l) = 1 — £ ]^[ (1 hnd) 

i=l 

(6) 

where I c h = 1 — £ is the mutual information obtained from the 
channel. Note that the edge multiplicities are included in the 
above check and variable node computations. The a-posteriori 
mutual information I at a VN is found using 

dy 

1 = 1 £ II( 1 _ ^ = 1 _ C 1 - - hn,d v ) 

i=l 

where the second equality follows from The decoder is 
said to be successful when the a-posteriori mutual information 
at all the VNs of the protograph converges to 1 as the 
number of iterations of message passing goes to infinity. 
The BP threshold e* BP (B) of the ensemble described by the 
protograph with base matrix B is defined as the supremum of 
all erasure rates for which the decoder is successful. 

Example 3: The protograph B3.6 = (3 3) has a BP 
threshold of e* BP 0.4294. Note that all the CNs in the 
protograph are of degree 6 while all the VNs are of degree 
3. This BP threshold is expected because B3 6 corresponds to 
the (3, 6) regular LDPC block code ensemble. The following 
protograph B' 3 6 has a BP threshold e* BP w 0.4879 for L = 40. 
Note that, as before, all VNs are of degree 3 and all the CNs 
except the ones in the terminated portion of the code are of 



2 We will use the phrase "mutual information on an edge" to mean the mu- 
tual information between the message on the edge and the bit corresponding 
to the adjacent VN. 
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degree 6. 
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This is the C c (3,6) ensemble constructed in p8| . In terms of 
the notation introduced, this is given as Bo = Bi = E$2 = 
[1 1]; or equivalently as p%(x) — P2(x) — 1 + % + x 2 . □ 

The above example illustrates the strength of protographs — 
they allow us to choose structures within an ensemble defined 
by a pair of degree distributions that may perform better than 
the ensemble average. In fact, the BP performance of regular 
LDPC-CC ensembles has been related to the maximum-a- 
posteriori (MAP) decoder performance of the corresponding 
unstructured ensemble JTOl. 



2) WD: We now analyze the performance of the WD 



described in Section III-B in the limit of infinite blocklengths 
and the limit of infinite iterations of belief-propagation within 
each window. 

Remark: In the limit of infinite blocklength, each term in 
the base protograph B is replaced by a permutation matrix of 
infinite size to obtain the parity-check matrix, and therefore the 
latency of any window size is infinite, apparently defeating the 
purpose of WD. Our interest in the asymptotic performance, 
however, is justified as it allows us to establish lower bounds 
on the probability of failure of the windowed decoder to 
recover the symbols of the finite length code. In practice, it 
is to be expected that the gap between the performance of 
a finite length code with WD and the asymptotic ensemble 
performance of the ensemble to which the code belongs 
increases as the window size reduces due to the reduction 
in the blocklength of the subcode defined by the window. 

The asymptotic analysis for WD is very similar to that of the 
BP decoder owing to the fact that the part of the code within a 
window is itself a protograph-based code. However, the main 
distinction in this case is the definition of decoding success. In 
the case of BP decoding, the decoding is considered a success 
only when, for any symbol in the codeword, the probability 
of failing to recover the symbol goes to (or equivalently, 
the a-posteriori mutual information goes to 1) as the number 
of rounds of message-passing goes to infinity. On the other 
hand, the decoding within a window is successful as long 
as the probability of failing to recover the targeted symbols 
becomes smaller than a predecided small value 6. The decoder 
performance therefore depends on two parameters: the window 
size W and the target erasure probability 8. 

We define the threshold s*^(B,W,S) of the i th window 
configuration to be the supremum of the channel erasure rates 
for which the WD succeeds in retrieving the targeted symbols 



of the j th window with a probability at least (1 — 5), given 
that each of the targeted symbols corresponding to the first 
(£ — 1) window configurations is known with probability 1 — 6. 
Fig. ^illustrates the threshold s%. (B, W, 5) of the i th window 
configuration. The windowed threshold e*(B,W, 6) is then 




WJ'M 



m,K'M 



WK'M 



Fig. 2. Illustration of the threshold of the i th window configuration 
e*^(B, W,5). The targeted symbols of the previous window configurations 
are known with probability 1 — 5. The targeted symbols within the window are 
highlighted with a solid blue bar on top of the window. The symbols within 
the blue (hatched) region in the window are initially known with probability 
1 — e. The task of the decoder is to perform BP within this window until the 
erasure probability of the targeted symbols is smaller than <5. The window is 
then slid to the next configuration. 



defined as the supremum of channel erasure rates for which the 
windowed decoder can decode each symbol in the codeword 
with probability at least 1 — 5. 

We assume that between decoding time instants, no infor- 
mation apart from the targeted symbols is carried forward, i.e. 
when a particular window configuration has been decoded, 
all the present processing information apart from the decoded 
targeted symbols themselves is discarded. With this assump- 
tion, it is clear that the windowed threshold of a protograph- 
based LDPC-CC ensemble is given by the minimum of the 
thresholds of its window configurations. For the classical 
and modified constructions of LDPC-CC described in Section 



II-B all window configurations are similar except the ones 
at the terminated portion of the code. Since the window 
configurations at the terminated portions can only perform 
better, the windowed threshold is determined by the threshold 
of a window configuration not in the terminated portion of the 
code. Note that the performance of WD when the information 
from processing the previous window configurations is made 
use of in successive window configurations, e.g. when symbols 
other than the targeted symbols that were decoded previously 
are also retained, can only be better than what we obtain here. 

We now state a monotonicity property of the WD the proof 
of which is relegated to Appendix [I] 

Proposition 1 (Monotonicity of WD performance in W): 
For any C m ( J, K) ensemble B, 

e*{B,W,8) < e*(B,W + l,6). 



It follows immediately from the definition of the windowed 
threshold that 

e* (B, W, 5) < e* (B, W, 6') V 6 < 5'. 

Furthermore, from the continuity of the density evolution 
equations (|6]l and |5]), we have that when we set 8 — 0, we 



8 



decode not only the targeted symbols within the window but 
all the remaining symbols also. Since the symbols in the right 
end of the window are the "worst protected" ones within the 
window (in the sense that these are the symbols for which the 
least number of constraints are used to decode), we expect the 
windowed thresholds e* (B, W, 6 = 0) to be dictated mostly by 
the behavior of the submatrix Bo under BP. In the following, 
when the base matrix B of the protograph corresponding to 
an ensemble C is unambiguous, we will write e*(B, W, S) and 
e*(C, W, 5) interchangeably. 

We next turn to giving some properties of LDPC-CC 
ensembles with good performance under WD. We start with 
an example that illustrates the stark change in performance a 
small difference in the structure of the protograph can produce. 

Example 4: Consider WD with the ensemble C c (3,6) in 
Example [3] with a window of size W = 3. The corresponding 
protograph defining the first window configuration is 




and we have e*(C c (3,6),W = 3,6 = 0) = 0. This is seen 
readily by observing that there are VNs of degree 1 that are 
connected to the same CNs. In fact, from this reasoning, we 
see that e*(C c (J, K'J),W, 6 = Q) = QVJ<W<L. 

As an alternative, we consider the modified construction of 
to obtain the C m (J,K) ensemble B' given by 



II-B2 



Section 

B = [2 2]7Bi = [1 1]. This ensemble has a BP threshold 
e* BP (B') « 0.4875 for L = 40 which is quite close to that 
of the ensemble C c (3,6), e* BP {C c (3, 6)) « 0.4879. WD with 
a window of size 3 for this ensemble has the first window 
configuration 

2 2 
1 1 2 2 
112 2 

which has a threshold e*(B',W = 3,6 = 0) 0.3331, i.e. 
we can theoretically get close to 68.3% of the BP threshold 
with < 10% of the latency of the BP decoder. Note that 
this improvement in threshold has been obtained while also 
increasing the rate of the ensemble, since m s = 1 for the B' 
ensemble in comparison with m s = 2 for C c (3, 6). □ 

The above example illustrates the tremendous advantage 
obtained by using C m (J,K) ensembles for WD even under 
the severe requirement of 6 = 0. The following is a good 
rule of thumb for constructing LDPC-CC ensembles that have 
good performance with WD. 



Design Rule 1: For C m {J,K'J) 



JAj) 



> 2 for all j e [K'\ where dj 



ensembles, set 

: mindegfj^a;)). 



The above design rule says that for (J,K'J) ensembles, it 
is better to avoid degree-1 VNs within a window. Note that 
none of the C C (J, K'J) ensembles satisfy this design rule. We 
now illustrate the performance of LDPC-CC ensembles with 
WD when we allow 6 > 0. 



Example 5: We compare three LDPC-CC ensembles. The 
first is the classical LDPC-CC ensemble C\ = C c (3, 6). The 
second and the third are LDPC-CC ensembles constructed as 



described in Section II-B2 The ensemble C 2 is defined by the 
polynomials 

Px(x) = 2 + x 2 ,p 2 (x) = 2 + x 
and C3 is defined by 

qi(x) = q 2 (x) = 2 + x. 

We first observe that all three ensembles have the same 
asymptotic degree distribution, i.e. all are (3, 6) regular LDPC- 
CC ensembles when L->oa While C\ and C 2 have a memory 
m s = 2, C3 has a memory m s — 1. Therefore, for a fixed 
L, while Ci and C 2 have the same rate, C3 has a higher 
rate. Another consequence of a smaller m s is that C 3 can be 
decoded with a window of size W m i n {C^) = 2. Further note 
that whereas C 2 and C3 satisfy Design Rule [T] C\ does not. 
For a window of size 3, the subprotographs for ensembles C\ 
and C3 are as shown in Example [4j and that for ensemble C 2 
is as shown below 

2 2 
1 2 2 
10 12 2 

In Fig. [3] we show the windowed thresholds plotted against 
the window size for the three ensembles C\,C 2 and C 3 by 
fixing L — 100 for 6 € {10 -6 , 10~ 12 }. 
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Fig. 3. Windowed threshold as a function of the window size for the 
ensembles d,i = 1,2,3 with 5 e {lO" 6 , 1CT 12 }. The rates of the 
ensembles Ci and C2 are 0.49 whereas that of C3 is 0.495. The corresponding 
Shannon limits are therefore 0.51 for Ci and C2, and 0.505 for C3. 

A few observations are in order. The monotonicity of 
e* (B,W, 5) in W as proven in Proposition [T] is evident. The 
windowed thresholds e* (B, W, S) for C 2 and C3 are fairly close 
to the maximum windowed threshold even when W = W m i n . 
The windowed thresholds for ensembles C 2 and C3 are robust 
to changes in S, i.e. the thresholds are almost the same (the 
points overlap in the figure) for 6 — 10~ 6 and 6 = 10~ 12 . 
Further, the windowed thresholds e*(Ci, W, S) are fairly close 



9 



to the BP thresholds e* BP {d),i = 1,2,3 for W > 12. We will 
see next that this last observation is not always true. □ 

Effect of termination: The better BP performance of the 
C C (J,K) ensemble in comparison with that of the (J,K)- 
regular block code ensemble (cf. Example [3]) is because of 
the termination of the parity-check matrix of C C (J,K) codes. 
More precisely, the low-degree CNs at the terminated portion 
of the protograph are more robust to erasures and their erasure- 
correcting power is cascaded through the rest of the protograph 
to give a better threshold for the convolutional ensemble 
in comparison with that for the corresponding unstructured 
ensemble |16|. From the definition of the WD, we can see that 
the sub-protograph within a window does not have the lower- 
degree checks if previous targeted symbols are not decoded. 
Therefore, we would expect a deterioration in the performance. 
Furthermore, the Design Rule [T] increases the degrees of the 
CNs in the terminated portion. Therefore, the effect of different 
termination on the WD performance is of interest. 

Example 6: Tables [I] and [IT] illustrate the WD thresholds for 
C m (J,2J) ensembles that satisfy Design Rule [T] except when 
J = m s + 1. These ensembles are defined by the polynomials 

Px[x) =P2(x) = (J - m s ) + x + x 2 -\ hi" 1 '. 

Note that J > m s + 1. The ensembles are terminated so 
that the rate is R L = 0.49. The worst threshold with WD 
(corresponding to the least window size W m i n = m s + 1) 



is denoted e 



m s + l 



The largest threshold with WD is denoted 



and the BP threshold as e 



BP- 



The increase in the 



1,Rl 



TABLE I 

0.49, C m (J, 2 J), <5 : 



10~ 



J 


e m s + l 


£ L-m B 


£ BP 


2 


0.0008 


0.3162 


0.3342 


3 


0.4499 


0.4857 


0.4872 


4 


0.4449 


0.4469 


0.4961 


5 


0.3915 


0.3923 


0.4969 


6 


0.3469 


0.3475 


0.4959 


7 


0.3115 


0.3118 


0.4891 


8 


0.2829 


0.2832 


0.4785 


9 


0.2595 


0.2597 


0.4666 




TABLE II 




= 2,R L = 0.49, C m (J, 2J),<5 = 10~ 


J 


£ m s +l 


^ L — m s 


£ BP 


3 


0.0189 


0.4882 


0.4876 


1 


0.4875 


0.4947 


0.4958 


5 


0.4493 


0.4501 


0.4971 


6 


0.3941 


0.3945 


0.4972 


7 


0.3489 


0.3492 


0.4969 


8 


0.3131 


0.3133 


0.4967 


9 


0.2843 


0.2845 


0.4957 


10 


0.2607 


0.2608 


0.4937 



gap between e* L _ m and e* BP with increasing J illustrates the 
loss due to edge multiplicities ("weaker" termination). This is 
because the terminations at the beginning and at the end of 
the code are different, i.e. the CN degrees in the terminated 
portion at the beginning of the code are 2(J — m s ) which 
increases with J; whereas those at the end of the code are 2, 



a constant. Thus, much of the code performance is determined 
by the "stronger" (smaller check-degree) termination, the one 
at the end of the code for J > m s + 1. This is also seen by 
the fact that the gap between 4 +1 and s* L _ m decreases as 
J increases, meaning that the termination at the beginning of 
the code is weak and increasing the window size helps little. 
Note that the C m (3, 6) ensemble in Table [IT] is in fact the 
C c (3, 6) classical ensemble, and that £* L _ m is larger than the 
corresponding BP threshold. This is possible since WD only 
demands that the erasure probability of the targeted symbols 
is reduced to 8. In contrast, BP demands that the erasure 
probability of all the symbols is reduced to 0. □ 

From the above discussion, we can add the following as 
another design rule. 



Design Rule 2: For C m (J, K) ensembles, keep the ter- 
mination at the beginning of the code strong, preferably 
stronger than the one at the end of the code. That is, use 
polynomials V = {pj(x),j £ [K')} such that each of the 
sums 



K' 

Hp. 



(0) 
3 1 



K 



(y-i) 



is kept as small as possible. 



Targeted symbols: We have thus far considered only the 
first K' VNs in the sub-protograph contained within the 
window to be the targeted symbols. However, as an alternative 
way to trade-off performance for reduced latency, it is possible 
to consider other VNs also as targeted symbols. In this 
case, the window would be shifted beyond all the targeted 
symbols after processing each window configuration. For a 
window of size W, let us denote by e* (B, W, 5) the windowed 
threshold when the targeted symbols are the first iK' VNs, 

1 < i < W. Hence, e* (B, W, 8) = (B, W, 8). By definition, 
e*(B,W,8) <e*(B,W,6). 

Example 7: Consider the C m (6, 12) ensemble with m s = 1 
defined by pi(x) — P2{x) = 3 + 3x, denoted C4; and the 
C m (4,8) ensemble with m s = 1 defined by qi(x) — q2(x) = 

2 + 2x, denoted C5. Also consider ensembles Cq and C 7 given 
by ri(x) — r 2 {x) = 2 + Ax and Si(x) = s 2 {x) = 2 + 2x + 
2x 2 respectively. Both Ce and C7 are C TO (6, 12) ensembles, 



but with memory m s = 1 and 2 respectively. Table III gives 
the windowed thresholds e*(Cj,W — 4, 8) with iK' targeted 
symbols for a window of size 4 for j — 4, 5, 6, 7. 

TABLE III 

Windowed thresholds e*(Cj, W = 4, 8 = 10 -12 ), j = 4, 5, 6, 7 



i 


C 4 


C 5 


c 6 


C 7 


1 


0.4429 


0.4912 


0.4835 


0.4924 


2 


0.4429 


0.4905 


0.4835 


0.4919 


3 


0.4427 


0.4824 


0.4828 


0.4824 


4 


0.4294 


0.3331 


0.3331 


0.3331 



One might expect the windowed threshold e*(H,W, 8) 
to be higher for an ensemble for which e^(B, W 7 , <5) is 
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higher. This is not quite right: e|(C 4 , 4, lCT 12 ) 0.4294 > 
0.3331 re e*(C 5 ,4,10- 12 ) whereas e* (C 5 , 4, 10~ 12 ) > 
e*(C4, 4, 10~ 12 ) V i < 4. This can again be explained as the 
effect of stronger termination in C5 in comparison with C4. 
This is also evident in the larger thresholds for the (6, 12) 
ensemble Cq with same memory as C4, but stronger termina- 
tion. Also, keeping the same termination and increasing the 
memory improves the performance, as is exemplified by the 
larger thresholds of C7 in comparison with those of C5. □ 

The windowed thresholds e*(B, W, 5) quantify the unequal 
erasure protection of different VNs in the sub-protograph 
within the window. Furthermore, it is clear that for good per- 
formance, it is advantageous to keep fewer targeted symbols 
within a window. 

B. Finite length performance evaluation 

The finite length performance of LDPC codes under iterative 
message-passing decoding over the BEC is dependent on the 
number and the size of stopping sets present in the parity- 
check matrix of the code (23), |25|. Thus, the performance 



of the codes varies based on the parity-check matrix used 
to represent the code and, consequently, the performance 
of iterative decoding can be made to approach that of ML 
decoding by adding redundant rows to the parity-check matrix 
(See e.g. [26]). However, since we are exploiting the structure 
of the parity-check matrix of the convolutional code, we 
will not be interested in changing the parity-check matrix 
by adding redundant rows as this destroys the convolutional 
structure. The ensemble stopping set size distribution for some 



protograph-based LDPC codes was evaluated in |27| where 
it was shown that a minimum stopping set size that grows 
linearly in blocklength is important for the good performance 
of codes with short blocklengths. This analysis is similar to the 
analysis of the minimum distance growth rate of LDPC-CC 
ensembles — see (28] and references therein. It is worthwhile 
to note that although the minimum stopping set size grows 
linearly for protograph codes expanded using random permu- 
tation matrices, the same is not true for codes expanded using 
circulant permutation matrices (29). In the following we will 
evaluate the finite length performance of codes constructed 
from C m (J,K) ensembles with BP and WD through Monte 
Carlo simulations. WD was considered with only the first 
K'M symbols as the targeted symbols. 

In Figs. |4] and [5] the symbol error rate (SER) and the 
codeword error rate (CER) performance are depicted for codes 
C\ E C\ and C2 G C2, where the ensembles C\ and C2 were 
defined in Example [5] The codes used were those constructed 
by Liva fT2) by expanding the protographs using circulant 
matrices (and sums of circulant matrices) and techniques of 
progressive edge growth (PEG) 1 30 1 and approximate cycle 
extrinsic message degree (ACE) [31 1 to avoid small cycles in 
the Tanner graphs of the codes. The girth of both the codes 
C\ and C2 was 12. The parameters used for the construction 
were L = 20 and M — 512 so that the blocklength 
n = LK'M = 20480 and R L = 0.45. The BP thresholds 
for ensembles C\ and C2 with L = 20 were 0.4883 and 
0.4882 respectively. As is clear from Figs. [4] and [5] code C2 
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Fig. 4. SER performance for BP and Windowed Decoding over BEC. 
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Fig. 5. CER performance for BP and Windowed Decoding over BEC. Also 
shown is the (Singleton) lower bound Psb as SB. 



outperforms code C\ for small window sizes (W = 3, 5), 
confirming the effectiveness of the proposed design rules for 
windowed decoding. For larger window sizes (W = 10), 
there is no marked difference in the performance of the two 
codes. It was also observed that for small M values (< 128), 
the performance of codes constructed through circulant per- 
mutation matrices was better than those constructed through 
random permutation matrices. This difference in performance 
diminished for larger M values. 

We include in Fig. [5] for comparison, a lower bound on 
the CER P cw . The Singleton bound, Psb, represents the 
performance achievable by an idealized (n, k) binary MDS 
code. This bound for the BEC can be expressed as 



> 



E 

-n-k+l 



e j (l-s) n - 



SB- 



Note that by the idealized (n, k) binary MDS code, we mean 
a binary linear code that achieves the Singleton bound d m i n < 
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n — k + 1 with equality. This code does not exist for all values 
of k and n. 

V. Erasure Channels with Memory 

We now consider the performance of LDPC-CC ensembles 
and codes over erasure channels with memory. We consider 
the familiar two-state Gilbert-Elliott channel (GEC) (32), (33) 
as a model of an erasure channel with memory. In this model, 
the channel is either in a "good" state G, where we assume the 
erasure probability is 0, or in an "erasure" state E, in which 
the erasure probability is 1. The state process of the channel is 
a first-order Markov process with the transition probabilities 
V{E ^G}=g and P{G -> E} = b. With these parameters, 
we can easily deduce (34) that the average erasure rate e and 
the average burst length A are given by 



¥{E} 



b + g 



We will consider the GEC to be parameterized by the pair 
(e, A). Note that there is a one-to-one correspondence between 
the two pairs (b,g) and (e, A). 

Discussion: The channel capacity of a correlated binary 
erasure channel with an average erasure rate of e is given as 
(1 — e), which is the same as that of the memory less channel, 
provided the channel is ergodic. Therefore, one can obtain 
good performance on a correlated erasure channel through the 
use of a capacity-achieving code for the memoryless channel 
with an interleaver to randomize the erasures (27), (35). This is 
equivalent to permuting the columns of the parity-check matrix 
of the original code. We are not interested in this approach 
since such permutations destroy the convolutional structure of 
the code and as a result, we are unable to use the WD for 
such a scheme. 

Construction of LDPC block codes for bursty erasure chan- 
nels has been well studied. The performance metric of a 
code over a bursty erasure channel is related to the maximum 
resolvable erasure burst length (MBL) denoted A max (35), 
which, as the name suggests, is the maximal length of a single 
solid erasure burst that can be decoded by a BP decoder. 
Methods of optimizing codes for such channels therefore focus 
on permuting columns of parity-check matrices to maximize 
A„ 



e.g. [36|-|41|. Instead of permuting columns of the 
parity-check matrix, in order to maintain the convolutional 
structure of the code, we will consider designing C m (J,K) 
ensembles that maximize A max . 



A. Asymptotic Analysis 

1 ) BP: As noted earlier, the performance of LDPC-CC en- 
sembles depends on stopping sets. The structure of protographs 
imposes constraints on the code that limit the stopping set sizes 
and locations, as will be shown shortly. 

Let us define a protograph stopping set to be a subset 
5(B) of the VNs of the protograph B whose neighboring 
CNs are connected at least twice to 5(B). These are also 
denoted as S(V), in terms of the set of polynomials defining 
the protograph. We define the size of the stopping set as 



the cardinality of 5(B), denoted |5(B)|. We call the least 
number of consecutive columns of B that contain the stopping 
set 5(B) the span of the stopping set, denoted (5(B)). Let 
us denote the size of the smallest protograph stopping set 
of the protograph B by |5(B)|*, and the minimum number 
of consecutive columns of the protograph B that contain a 
protograph stopping set by (5(B))*. When the protograph 
under consideration is clear from the context, we will drop it 
from the notation and use |5|* and (5)*. The minimum span 
of a stopping set is of interest because we can give simple 
bounds for A max based on (5(B))*. Note that the stopping 
set of minimal size and the stopping set of minimal span are 
not necessarily the same set of VNs. However, we always have 

|5(B)|* < (5(B))*. 

The following example clarifies the notation. 

Example 8: Let us denote the base matrix corresponding to 
the protograph of the ensembles Cj of Example [5] as B^ 1 ',! = 
1,2,3. For ensembles C\ and C3, the first two columns of 
B^,i = 1,3 form a protograph stopping set, i.e. 5(B^) = 
{Vi,V2j-,i = 1,3 is a stopping set. This is clear from the 
highlighted columns below 



B (D 



/ 1 1 
11110 

111111 

1111 

000011 



B (3) 



V : : : : : : 

/ 2 2 

1 1 2 2 

112 2 

1 1 

V ! : : : : \ 



\ 



J 



Therefore, |5(B«)|* < 2 and (5(B«))* < 2. Since no 
single column forms a protograph stopping set, |5(BW)|* > 2 
and (5(BW))* > 2, implying |S(B«)|* = (5(B«))* = 
2,i= 1,3. 

For ensemble C%, the highlighted columns of B^ 2 ) in 
the following matrix form a protograph stopping set, i.e. 
5(B< 2 )) = {Vi, 14} is a stopping set. 



B (2) 



( 2 2 
1 2 2 
10 12 2 
1 1 
O00O10 



V 



/ 



Thus, |5(B( 2 ))|* < 2 and (5(B< 2 )))* < 4. As no single 
column of B( 2 ) is a protograph stopping set and no three 
consecutive columns of B^ 2 ' contain a protograph stopping 
set, it is clear that |5(B( 2 ))|* > 2 and (5(B< 2 )))* > 4, so 
that 

2 = |5(B( 2 >)|* < (5(B^))* =4. 
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' K'{m s {hM) - 1) + {h - h + 1), i(h,h) = k 2 < ihJh ^ 3h = j(h,h) 

K'(m s (h,l 2 ) - 1) + 1) i(h,h) = k 2 < kiJh < 3h = j(h,h) 

K'(m s (h,l 2 ) - 1) + 1, i(h,h) = k t < k 2 ,3i 2 < 3h = 3'(h,h) 

^K'{m s {h,h) - 1) - (h -h - 1), i(h,h) = ki < k 2 ,3h < 3i 2 = j(h,h)- 



(7) 



In these cases, it so happened that the stopping set with the 
minimal size and the stopping set with the minimal span were 
the same. □ 

Our aim in the following will be to obtain bounds for the 
maximal (5(B))* over C m ( J, K) ensembles with memory m s , 
which we denote (S(J, K,m s ))* , and design protographs that 
achieve minimal spans close to this optimal value. 

The analysis of the minimal span of stopping sets for un- 
structured LDPC ensembles was performed in [42 1. However, 
the structure of the protograph-based LDPC-CC allows us to 
obtain (S(J, K,m s ))* much more easily for some C m (J,K) 
ensembles. 

We start by observing that if one of the VNs in the 
protograph is connected multiple times to all its neighboring 
CNs, then it forms a protograph stopping set by itself. In order 
to obtain a larger minimum span of stopping sets, it is desirable 
to avoid this case, and we include this as one of our design 
criteria. 



Design Rule 3: For a C m (J,K) ensemble, choose the 
polynomials Pj(x) such that for every j £ \K'], there 
exists < ij < (m, + 1) J' — 1 such that pj j) = 1. 



Using the polynomial representation of LDPC-CC ensem- 
bles is helpful in this case since we can easily track stopping 
sets as those subsets that have polynomials whose coefficients 
are all larger than 1. From this fact, we can prove the 
following. 

Proposition 2 ((S)* for C m (J,2J) protographs): For 
C m (J, 2 J) protographs of memory m s defined by polynomials 
P\[x) and p 2 (x), (S)* can be upper bounded as 



(sy< 



2m s , = i2<ii,h<3i = m s 

2m 8 - 1, = i 2 < < j 2 = m s 

2m s - 1, = ii < i 2 , ]2 < ji = m s 

2m s ~2, = ii < i 2 , j x < j 2 = m s 

where ii — min deg(pi(x)) and ji = deg(pi(x)),l = 1,2. ■ 

We give the proof in Appendix [TTJ We see from the above 
that (S(J,2J,m s ))* < 2m s and a necessary condition for 
achieving this span is the first of four possible cases listed 
above, which we include as another design criterion. 



Design Rule 4: For C m (J,2J) ensembles with mem- 
ory m s , set 

mmdeg{p 2 (x)) = and deg(pi(x)) = m s . 



Corollary 3 (Optimal C m (J,2J) protographs): For 
C m (J,2J) protographs with memory m s and J > 

(S{J,2J,m s ))* = 2m s . 



2, 



The proof is given in Appendix [III] Note that ensemble C 2 
in Example [5] achieves (S(J,2J,m s ))* , as was observed in 
Example [8] It also satisfies design rules [T[ [3] and |4] We bring 
to the reader's attention here that constructions other than the 
one given in the proof of the above corollary that achieve 
(£)* = 2m s are also possible. These constructions allow us 
to design C m (J,2J) ensembles for a wide range of required 
(S)* . We quickly see that a drawback of the convolutional 
structure is that if m s is increased to obtain a larger (S)*, the 
code rate Rl decreases linearly for a fixed L. 

We give without proof the following upper bound for (S)* 
for C m (J,K' J) ensembles, as it follows from Proposition [2] 

Proposition 4 ( (S)* for C m (J, K' J) protographs): 
For C m (J,K'J) protographs defined by polynomials 

V = {pj(x),j £ [K 1 ]}, we have 



(sy 



< min 



{(S h ,i 2 )} 



where (Si lt i 2 ) is the upper bound for the minimal span (S^^^) 
of stopping sets Si lt i 2 confined within subsets of the form 
r hM {x) = a 1 (x)p h (x) + a 2 (x)p h (x) given in Equation 0, 
where we have used the notation ii u = min deg(pi u (x)), ji = 
deg(pi u (x)), u = 1,2, i(h,l 2 ) = mm{i h ,ii 2 }, j(h,l 2 ) = 
max{j h ,j l2 } and m s (h,l 2 ) = j{h,h) - i(h,l 2 ). ■ 

Discussion: By looking at the stopping sets confined within 
columns corresponding to two polynomials only, we can use 
Proposition [2] to upper bound the span of these stopping sets. 
The minimal such span over all possible choices of the two 
columns therefore gives an upper bound on the minimal span 
of the (J,K'J) protograph. Since (Si lt i 2 ) < K'm s V h,l 2 
from Equation Q, we have (S(J, K'J, m s ))* < K'm s , which 
is similar to the result in Proposition|2] This bound is, however, 
loose in general. 

For terminated codes, we can give an upper bound for (S)* 
that is tighter in some cases. 

Corollary 5 ((S)* for C m (J, K) protographs): For 
C m (J,K) protographs terminated after L instants, 
(S)*<K'L. 

Proof: From the Singleton bound for the protograph, we 
have (S)* < J'(L + m s ). Since we need m s < y~^L for a 
positive code rate in |2}, (S)* < = K'L. 

Note that for C m (J,K'J) protographs, this is tighter than 
the bound (S)* < K'm s < K'(K' - 1)L, which, in the worst 
case, is a factor (K' — 1) times larger. However, since we are 
interested mainly in ensembles for which m s <C L, this bound 
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might be looser than the one in Proposition [4] for C m (J, K'J) 
ensembles. ■ 

Example 9: Consider the C m ( J, K'J) ensemble with mem- 
ory m s = u(K' - 1) + l,m s < (K' - l)L defined by the 
polynomials 

p l (x) = (J-l)+x>>,le [K'l 

jl = m s — u(l — 1). It can be shown by an argument similar 
to the one used to prove Corollary [3] that for the protograph 
of this ensemble, (5)* = K'u + 2. This is exactly the bound 
in Proposition [4] since 



mm{(S luh )} = {Sk>-i,k>) = K'u + 2. 

h <h 



Thus, in this case 



(sy = 



K" 



1 



(m s -l) + 2 = 



K' 



K' 



1 



which is roughly only a fraction of the (loose) upper bound for 
(S(J, K' J,m s ))* suggested in the discussion of Proposition 
[4] The constructed C m (J,K'J) protographs are thus optimal 
in the sense of maximizing the minimal span of stopping sets, 
i.e. 

(S (J, K' J, u(K' -1) + 1))* =K'u + 2 Vwe [L-l]. 

They also satisfy Design Rules [T] and [3] for J > 2. Although 
Proposition |4] gave a tight bound for (5) * in this case, it is 
loose in general. □ 

We can show that the C m (J, K) protographs have minimal 
spans at least as large as the corresponding spans of C m (a, K) 
protographs. 

Proposition 6: (S(J,K,m s ))* > (S(a,K,m s ))* where 
a = gcd(J. K) > 2. 

Proof: The equality is trivial when a = J. When 
2 < a < J = aJ', one way of constructing the C m (J,K) 
ensembles with memory m s is to let each set of modulo 
polynomials Vi themselves define C m (a,K) ensembles with 
memory m s . The result then follows by noting that a stopping 
set for the polynomials V has to be a stopping set for every 
set of polynomials "Pi, I = 0, 1, • • • , J' — 1. ■ 

The construction proposed above often allows us to strictly 
increase the minimal span of the C m (J,K) ensemble in 
comparison with the C m (a,K) ensemble, as illustrated by the 
following example. 

Example 10: Consider the construction of a C m (4,6) en- 
semble with memory 3. Let us call it Cg. The different 
parameters in this case are J = 4, K = 6, a — 2, J' = 2, 
K' = 3 and m s = 3. Since m s — u(K' — 1) + 1 with 
u = 1, we have for C m (2,6) protographs, (5(2,6,3)}* = 5 
from Example [9] and we will define the modulo polynomials 
To to be the optimal construction that achieves this minimal 



span, i.e. Vq = {1 



1 + x 2 , 1 + x}. Then, by defining 



V\ = {l + x 3 ,l + x 3 , 1 + .t 3 }, we can show that (S(V))* = 6 
and hence 

(5(4,6,3))* > 6 > 5 = (5(2,6,3))*. 



Note that the protograph defined by V has no degree- 1 
VNs associated with the component matrix Bo. In fact, the 
constructed C m (4, 6) ensemble has e*(Cg,m s + 1, 10 -12 ) « 
0.6469, fairly close to the Shannon limit of e sh — |, even with 
the smallest possible window size. Table IV lists the windowed 



TABLE IV 

e*(C 8 ,m s + l,10- 12 ) 



1 

0.6469 



2 

0.6184 



3 

0.5803 



4 

0.4997 



thresholds of this ensemble with different numbers of targeted 
symbols within the smallest window for S = 10~ 12 . □ 
2) WD: The asymptotic analysis for WD is essentially 
the same as that for BP. We will consider WD with only 
the first K' symbols within each window as the targeted 
symbols. We are now interested in the sub-protograph stopping 
sets, denoted 5(B, W), that include one or more of the 
targeted symbols within a window. Let us denote the minimal 
span of such stopping sets as (S(B,W))* . Since stopping 
sets of the protograph of the LDPC-CC are also stopping 
sets of the sub-protograph within a window, and since such 
stopping sets can be chosen to include some targeted symbols 
within the window, we have (S(B,W))* < (5(B))*. In fact, 
(S(B,W))* = (5(B))* when 



W > 



(sy 



K' 



since in this case the first K' 



(sy 

K' 



columns are completely 



contained in the window. Further, we have 

(S(B,W))* < (S(B,W + 1))*. 

This is true because a stopping set for window size W 
involving targeted symbols is not necessarily a stopping set 
for window size W + 1, whereas a stopping set for window 
size W + 1 is definitely a stopping set for window size W. 

Remark: When the first iK' symbols within a window are 
the targeted symbols, we have for i < W — m s 

(5 4 (B,W0)* = {S{B,W-i + l)Y 

where (5^(B, W))* denotes the minimal span of stopping sets 
of the sub-protograph within the window of size W involving 
at least one of the iK' targeted symbols, and (5i(B, W))* = 
(5(B, W))*. Consequently, we have 

(Si(B,W))* < (S(B,W))*. 

The definition of (Si(B,W))* can be extended to accommo- 
date W — m s + 1 < i < W, as in the case of windowed thresh- 
olds. In particular, we have (S W (B, W))* = (5(B ))* < J', 
where the last inequality is from the Singleton bound. 

Example 11: Consider the ensemble C2 defined in Example 
[5] With a window of size W = m s + 1 — 3, we have 
(5(C2,3))* = 2 with the corresponding stopping set 53 = 
{^2, V3} highlighted below 
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and with a window size W = 4, we have (5(C 2 ,4))* = 
(5(C 2 ))* = 4, and the corresponding stopping sets S4 = 
{V x ,Vi} and = {Vi,V 2 , V 4 } are as follows 



/ 


2 


2 

















\ 
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2 
















1 
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2 


2 








V 








1 
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2 


2 / 
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2 
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2 
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2 








V 








1 








1 


2 


2 / 



Note that for window size 3, whereas the minimal span of 
a stopping set involving VN V 2 is 2, that of a stopping set 
involving V\ is 4. However, for window size 4, the stopping 
set involving V\ with minimal span, denoted S4, and that 
involving V 2 , 5^, each have a span of 4, although their 
cardinalities are 2 and 3 respectively. We have in this case, 
5 4 C Notice that (5 2 (C 2 ,4))* = (5(C 2 , 3))* =2. □ 



B. Finite length analysis 

1 ) BP: We now show the relation between the parameters 
A max and (5(B))*. We shall assume in the following that 
(5)* > 2, i.e. every column of the protograph has at least one 
of the entries equal to 1. We will consider the expansion of 
the protographs by a factor M to obtain codes. 

Proposition 7: For any (J, K) regular LDPC-CC, A max < 
M(S)* - 1. 

Proof: Clearly, the set of the M(S)* columns of the 
parity-check matrix corresponding to the (5)* consecutive 
columns of B that contain the protograph stopping set with 
minimal span must contain a stopping set of the parity- 
check matrix. Therefore, if all symbols corresponding to these 
columns are erased, they cannot be retrieved. ■ 

Corollary 8: A terminated C m {J,K'J) LDPC-CC with 
m s = u(K' — 1) + l,u € [L — 1] can never achieve the 
MBL of an MDS code. 

Proof: From the Singleton bound, we have A max < n — 
k = (L + m s )M, assuming that the parity-check matrix is 
full-rank. From Proposition [JJ we have, 

A max < M(S(J,K'J,u(K' - 1) + 1))* 
K' 



1 



K' - 1 



M - 1 



where the second equality follows from the discussion in 
Example^ Since we require m s < — {K' — 1)L for a 

non-negative code rate in Q, 



A, 



< 



(K' - l)m s + (K' - l)L 



K> 
m s )M 



M- 1 



which shows that the MBL of an MDS code can never be 
achieved. ■ 

Remark: Although the idealized binary (n, k) MDS code 
does not exist, there are codes that achieve MDS performance 



when used over a channel that introduces a single burst of 
erasures in a codeword. For example, the (2n, n) code with a 
parity-check matrix H = [I„ I„] has an MBL of A max = n. 

Despite the discouraging result from Corollary [8] we can 
guarantee an MBL that linearly increases with (5)* as follows. 

Proposition 9: For any (J, K) regular LDPC-CC, A max > 
M((S)* - 2) + 1. 

Proof: From the definition of (5)*, it is clear that if one of 
the two extreme columns is completely known, all other sym- 
bols can be recovered, for otherwise the remaining columns 
within the span of the stopping set 5 will have to contain 
another protograph stopping set, violating the minimality of 
the stopping set span (5)* (The two extreme columns are 
pivots of the stopping set |4T).) The largest solid burst that 
is guaranteed to have at least one of the extreme columns 
completely known is of length M((5)* — 2) + 1. Therefore, 
A max > M((5)* - 2) + 1. ■ 

Example 12: For the C m (J,K'J) ensemble with memory 
m s = u{K' — 1) + 1, u € [L — 1] in Example [9] we have 

'to, — l\ . , / m, — 1 



MK' 



K' - 1 



1 < A nin.r < MK' 



K> - 1 



-2M-1 



from Propositions [7] and [9] Thus, we can construct codes with 
MBL proportional to m s . □ 

2) WD: The MBL for WD A max (W) can be bounded as 
in the case of BP based on (5(B, W))*. Assuming that the 
window size is W > m s + 1, the targeted symbols are the first 
K' symbols within the window, and the polynomials defining 
the ensemble are chosen to satisfy Design Rule [3] we have 
(5(B, W))* > 2. Propositions UJ and |9] in this case imply that 

M((5(B, W))* - 2) + 1 < A rnax (W) < M(5(B, W))* - 1. 

C. Numerical results 

The MBL for codes C\ and C 2 (the same codes used 



in Section IV-B 1 was computed using an exhaustive search 
algorithm, by feeding the decoder with a solid burst of erasures 
and testing all the possible locations of the burst. The MBL 
for the codes we considered was 1023 and 1751 for codes C\ 
and C2, respectively. Note that for code C\, the MBL A max = 
1023 = 2M — 1, i.e., code C\ achieves the upper bound 
from Proposition [JJ More importantly, the maximum possible 
A max was achievable while maintaining good performance 
over the BEC with the BP decoder. However, the MBL for 
code C 2 , A rnax = 1751 < 2047 = AM - 1, is much smaller 
than the corresponding bound from Proposition [JJ In this case, 
although other code constructions with A max up to 2045 were 
possible, a trade-off between the BEC performance and MBL 
was observed, i.e. the code that achieved A max — 2045 was 
found to be much worse over the BEC than both codes C\ and 
C 2 considered here. Such a trade-off has also been observed 
by others, e.g. |40|. This could be because the codes that 
achieve large A max are often those that have a very regular 
structure in their parity-check matrices. Nevertheless, our code 
design does give a large increase in MBL (> 70%) when 
compared with the corresponding codes constructed from C c 
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ensembles, without any decrease in code rate (same m s ). The 
MBL achieved as a fraction of the maximum possible MBL 
^max/in — k) was roughly 9.1% and 15.5% for codes C\ 
and C2, respectively. 

In Figs |6| [7] and [8] we show the CER performance obtained 
for codes C\ and Ci over GEC channels with A = 10, 50 
and 100 respectively, and e G [0.1, 0.6]. As can be seen from 




Fig. 6. CER Performance on GEC with A = 10 with Singleton bound (SB). 




Fig. 7. CER Performance on GEC with A = 50 with Singleton bound (SB). 

the figures, for W = 3, code C2 always outperforms code 
C\, while for W — 5 there is no such gain when A = 100. 
However, for W — 10 and for BP decoding, code C\ slightly 
outperforms C2. 

Note that the code C2 outperforms C\ for small e when the 
average burst length A = 100 for large window sizes and for 
BP decoding. This can be explained because in this regime, 
the probability of a burst is small but the average burst length 
is large. Therefore, when a burst occurs, it is likely to resemble 
a single burst in a codeword, and in this case we know that 
the code C2 is stronger than C\. Also note the significant 
gap between the BP decoder performance and the Singleton 




-A- £7, RP 

-a-C 2 RP 
- - -SR 



i 1 / 1 1 

0.1 0.2 0.3 0.4 0.5 0.6 

Fig. 8. CER Performance on GEC with A = 100 with Singleton bound 
(SB). 

bound, suggesting that unlike some moderate length LDPC 
block codes with ML decoding [38 1, LDPC-CC are far from 
achieving MDS performance with BP or windowed decoding. 

VI. Conclusions 

We studied the performance of a windowed decoding 
scheme for LDPC convolutional codes over erasure channels. 
We showed that this scheme, when used to decode terminated 
LDPC-CC, provides an efficient way to trade-off decoding 
performance for reduced latency. Through asymptotic perfor- 
mance analysis, several design rules were suggested to avoid 
bad structures within protographs and, in turn, to ensure good 
thresholds. For erasure channels with memory, the asymptotic 
performance analysis led to design rules for protographs that 
ensure large stopping set spans. Examples of LDPC-CC en- 
sembles that satisfy design rules for the BEC as well as erasure 
channels with memory were provided. Finite length codes 
belonging to the constructed ensembles were simulated and the 
validity of the design rules as markers of good performance 
was verified. The windowed decoding scheme can be used to 
decode LDPC-CC over other channels that introduce errors 
and erasures, although in this case error propagation due to 
wrong decoding within a window will have to be carefully 
dealt with. 

For erasure channels, while close-to-optimal performance 
(in the sense of approaching capacity) was achievable for the 
BEC, we showed that the structure of LDPC-CC imposed con- 
straints that bounded the performance over erasure channels 
with memory strictly away from the optimal performance (in 
the sense of approaching MDS performance). Nevertheless, the 
simple structure and good performance of these codes, as well 
as the latency flexibility and low complexity of the decoding 
algorithm, are attractive characteristics for practical systems. 
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Appendix I 
Proof of Proposition^ 

Consider the i th window configuration for window sizes W 
and W + 1 shown in Fig. [9] We are interested in a window 
configuration that is not at the terminated portion of the code. 
Call the Tanner graphs of these windows A = (Va,Ca, Ea) 




Fig. 9. Sub-pro tographs of window sizes W and W+l. The edges connected 
to targeted symbols from previous window configurations are shown in darker 
shade of gray. 

and B = (Vb,Cb, E b ) respectively, where Va,Vb and 
Ca, Gb are the sets of VNs and CNs respectively and Ea, Eb 
are the sets of edges. Clearly, Va C Vb,Ca C Cb, and 
Ea C Eb- Any VN in Va that is connected to some variable 
in Vb \ Va has to be connected via some CN in Cb \ Ca- The 
edges between these CNs and VNs in Va are shown hatched 
in Fig. [9] Consider the computation trees for the a-posteriori 
message at a targeted symbol in Va and that for the same 
symbol in Vb- Call them Ta and Tb respectively. Then we 
have Ta C Tb- 

We now state two lemmas which will be made use of 
subsequently. The proofs of these lemmas are straightforward 
and have been omitted. 

Lemma 10 (Monotonicity of C): The CN operation in |5]) is 
monotonic in its arguments, i.e., 

< x < x < 1 =*> C(x,y) < C(x',y) V y € [0,1], 

where the two-argument function C(x, y) = xy. ■ 

Lemma 11 (Monotonicity of Vj: The VN operation in (|6]l is 
monotonic in its arguments, i.e., 

< x < x' < 1 V(x, y) < VfV, y) V y € [0, 1], 

where V(x,y) = 1 — (1 — a;)(l — y). ■ 

The operational significance of the above lemmas is the fol- 
lowing: if we can upper (lower) bound the mutual information 
on some incoming edge of a CN or a VN, and use the bound 
to compute the outgoing mutual information from that node, 
we get an upper (lower) bound on the actual outgoing mutual 
information. Thus, by bounding the mutual information on 
some edges of a computation tree and repetitively applying 



Lemmas [T0| and [TTj one can obtain bounds for the a-posteriori 
mutual information at the root of the tree. 

We start by augmenting Ta, creating another computation 
tree Ta that has the same structure as Tb- In particular, Ta 
includes the additional edges corresponding to the hatched 
region. In Ta and Tb, we denote the set of these edges 
by E u (Ta) and E u (Tb) respectively. In 7~a > we assign zero 
mutual information to each edge in E u {1~a)- 

Now, let I Ta , I t a and I Tb be the a-posteriori mutual infor- 
mation at the roots of the trees Ta, Ta and Tb respectively. 
Then it is clear that I 7 * = I t a , since the messages on 
edges in E u (Ta) are effectively erasures and zero out the 
contributions from the checks in C\ \ Ca = Cb \ Ca- 

On the other hand, if we denote by I e (Tb ) the mutual infor- 
mation associated with an edge e e E u (Tb), and by l e (J~£) 
the mutual information associated with the corresponding edge 
in T£, we know that I e (T£) = so that 4(7^) < I e (T B ) 



Hence, we have from Lemmas [lOj and [TT| that I^a < F~b 
Since I Ta = I t a , it follows that I Ta < I Tb , as desired. 

Appendix II 
Proof of Proposition^ 

From the definitions made in the statement of Proposition [2] 
we have < ii < ji < m s , I = 1,2. We assume ii < ji in 
order to satisfy Design Rule|3] Since the code has memory m s , 
we have i = min{ii,i2} = and j = max{ji, j'2} = ^s- 
Consider the subset of columns of B corresponding to the 
polynomial r(x) = pi(x)bi(x) + P2{x)b2(x) where 



h(x) 



h = h ~ 1 
x h ~\ »a<ja-l 



and 



b 2 (x) 



x 11 , 



h = jl ~ 1 
X^-\ h<j!-l. 



We claim that this is a protograph stopping set. To see this, 
consider the columns corresponding to the above subset with 
(3(pi(x)) and (3(p2(x)) as the column polynomials defining 
B. We have 

f(x) = 0(pi(aO)t>i(a:) + (3{p 2 (x))b 2 (x) 

= (x 11 + x n ){x i2 + x l2+1 + ■■■+ x^- 1 ) 

+ {x i2 + x h )(x 11 + x il+1 + ■■■ + x jl - 1 ) 



rj-il+i2 _|_ rj.il+12 + l 



X 



jl+32-1 



r ii+*2-i 



+ x jl+l2 +x jl+l2+1 + ■ 
+ x il+i ' + x ll+l2+1 + ■ 
+ x ll+]2 + x ll+]2+l + • 
2x il+l2 +■■■ + 2x il+32 - 1 

+ 2x ll+j2 + ■■■ + 2x jl+12 - 1 



when ji > %i + 1, 1 = 1, 2. Similarly, it can be verified that 
f{x) has all coefficients equal to 2 in all other cases also. 
Clearly, f(x) < r(x) and thus r(x) can only differ from 
f(x) in having larger coefficients. Therefore, r(x) also has 
all coefficients greater than 1. This shows that the chosen 
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subset of columns form a protograph stopping set. Based on 
the parameters ii,ji,l = 1,2, we can count the number of 
columns included in the span of this stopping set and therefore 
give upper bounds on (S)* as claimed : 



(S)* < 



'2(ji 


- h) 







= h < h,h < h = ™s 


( 2(j 2 


-ia) 


-1, 





= «2 < «l,jl < h = TO s 


' 2(j! 


-ii) 


-1, 





= k < h-,h < ji = m s 


Xh 


- ii - 


-1), 





= H < «2, jl < J2 = ™s 



Appendix III 
Proof of Corollary[3] 

Consider the protograph of the ensemble given by pi(x) — 
( J — 1) + x m " and P2(x) = (J — 1) + x. Let the polynomial 
r{x) = pi(x)a\(x) + P2(x)a2(x) represent an arbitrary subset 
(chosen from the 2 2 " ls ~ 1 — 1 non-empty subsets) of the first 
(2m s — 1) columns of B, for any choice of polynomials a\(x) 
and a 2 (x) with coefficients in {0,1} and maximal degrees 
(m s — 1) and (m s — 2) respectively: 

di 

di(x) — djX^, i = 1, 2, di — m s — 1, d% = m s — 2 

where ap G {0, 1} and not all a\ s are zeros. When ai(a^) 7^ 

0, let ii = deg(ai(x)). Clearly, r(x) is a monic polynomial 
of degree (m s + ix). When a\{x) = and a 2 (x) ^ 0, let 
%2 = deg(aa(a;)). Then, r(x) is a monic polynomial of degree 
(1+12). Since in both these cases r(x) is a monic polynomial, 
there is at least one coefficient equaling 1. Thus, (S)* > 2m s — 

1. Finally, notice that 

Pl (x) + x m °- x p 2 (x) = (J - 1) + x m ° + 

(J — l)x ms ~ 1 + x" 1 " 
= (J-l) + ( J - l)^" 1 + 2x m % 

with all coefficients strictly larger than 1. Note that p\(x) cor- 
responds to the first column of the protograph and x ms ~ 1 p2(x) 
to the 2mf column. Thus, we have (S)* — 2m s . Since we 
have (S(J, 2 J, m s ))* < 2m s from Proposition [5] we conclude 
that (S(J,2J,m s ))* = 2m s . 
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